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Introduction 

The genetic code of the last common ancestor (LCA), or a minor variant of it, is 
present in all species. Its origin, in the pre-LCA era, has remained an enigma for four 
decades. My analysis reveals that all regularises dentiied in cede structure correlate 
strongly with path-distances in amino acid synthesis. The code accordingly evolved by 
adding amino acids as they appeared, during the growth of synthesis pathways 
outward from central metabolism. Codon assignments in the 'universal code' were 
found to derive from ancient transitional codes, formed deep within the pre-LCA era. 

Design of Study 

• Amino acid path-distances were evaluated as the number of reaction steps, from 
citrate cycle, required for synthesis. 

• Distinct transition codes were identified and used to reconstruct code evolution. 

• The path-distance model was then validated by showing that it unifies over twenty 
diverse structural regularities [dent fled in the genetic code and pre-LCA proteins. 

Amino Acid Synthesis Pathways 


Amino Acid and Codon Distribution on Synthesis Pathways 


• Amino acids 
form on ancient 
pathways 
catalyzed by 
fifty pre-LCA 
enzymes. 

• The twenty 

acids subdivide 
into four families 
with precursors 
OAA,aKG,Pyr, 
and PEP. 


Path-length contour 
[7] No. pre-LCA enzymes in paths 
[CGN] Codons. N, any base, R, purine 

Y pyrimidine 
RCC Reductive citrate cycle 
RPC Reductive pentose cycle 
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Path with multianionic components 

Scaled amino acid synthesis path 

■■■ Unsealed segment of path 

Segment with alpha-amino acids 

CT Central trunk 


Precursor-Product Amino Acid Pairs have Time-Ordered Codons 
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• Extension of amino acid synthesi 

(i) Codo-s in precursor/product a 

(ii) Mid-base and mean synthesis 

(2-7 steps] amino acids. 


pathways and code formation were linked as: 
lino acid pairs exhibit time-ordered 5'- and mid-base. 
)ath-length (L) correlate strongly among short-path 


Path-Distance Model Explains Code Structure 

. Woese (1965): NAN column triple, 
wriile NUN triptt 


Amino acids with short, medium, and long 
paths are chemically distinct and encoded 
differently: 

(1] Four NtV fixers form in 2 steps (or less), 
include both anionic residues (red), and 

have codons solely from the NAN column. 

(2) Ten amino acids with alky I- hydroxy and 
S-bearing side-chains form on 4-7 step 
paths. Consensus codons for 4-, 5-, and 

7-step residues are respectively from the 
NCN, NGN, and NUN column. 7 of 8 'sets 
of four' (intact code boxes] encode them. 

(3) Six basic (blue) and aromatic residues 
form on 9-14 step paths. They are 
encoded mainly by codon doublets. 

A 14-fcie ;e:>'pone"i a i 'a i-cff in ccdo" 
assignments over paths of 4- to 14- steps 
conforms with gradual slowing in the tempo 
evolution leading to the 'universal code'. 
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2. Taylor and Coates (1! 


tact boxes code for the 
o acids in proteins. 


EFfective path length (L) 
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Transition Codes Revealed by Amino Acid Path-Distances 


3. Perlwitz et al. (1988): Codori rnid-bas.e has most coding capacity. 


• NAN column codons for four 
'2-step' amino acids (NH 4 + 
fixers) and STOP signal are 
identified as a vestige of the 
first code. Codons of amino 
acids with 2-, 4-, 5-, and 7- 
step paths exhibit column- 

NAN-»NCN->NGN->NUN 
consistent with column-by- 
column growth of the code. 

• Early amino acids (2-7 step 
paths) acquired 7 of 8 intact 
boxes, suggesting each 
was allotted an intact box of 
codons on entering the code. 

• Latecomer amino acids (9- 
14 step paths) share 6 of 8 
subdivided boxes with an 
early amino acid (2-7 step 
path), or stop signal. This 

no cates they captured codons 
from early amino acids. 


coding capacity 
per codon site 


lino acids added per 
stage of code evolution 
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Conclusion 

Amino acid synthesis path length strongly influenced codon acquisition. 
Analysis of this relation reveals the gsnef.ii: cude e-. elves n fhree steps: 
(i) NH 4 + Fixers Code, with NAN column triplets, (ii) Code Expansion, and 
fiii) Oveicrming oy laiefO'-eis. \ci only s :he aenetic coce '.nive'sol 
therefore, it conserves .es:ige£ of :ne :ronsit o" codas that shaped it. tRNA 
with joint cofactor (amino acid synthesis) and adaptor (translation) functions 
are implicated in linking code formation with pre-LCA path extension. 
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